Exploiting textual queries for dynamically visual disambiguation
نویسندگان
چکیده
Due to the high cost of manual annotation, learning directly from web has attracted broad attention. One issue that limits performance current webly supervised models is problem visual polysemy. In this work, we present a novel framework resolves polysemy by dynamically matching candidate text queries with retrieved images. Specifically, our proposed includes three major steps: first discover and then select according keyword-based image search results, employ saliency-guided deep multi-instance (MIL) network remove outliers learn classification for disambiguation. Compared existing methods, approach can figure out right senses, adapt dynamic changes in outliers, jointly models. Extensive experiments ablation studies on CMU-Poly-30 MIT-ISD datasets demonstrate effectiveness approach.
منابع مشابه
Exploiting Visual Perception for Sampling-Based Approximation on Aggregate Queries
Efficient sampling algorithms have been developed for approximating answers to aggregate queries on large data sets. In some formulations of the problem, concentration inequalities (such as Hoeffding’s inequality) are used to estimate the confidence interval for an approximated aggregated value. Samples are usually chosen until the confidence interval is arbitrarily small enough regardless of h...
متن کاملWikimantic: Disambiguation for Short Queries
This paper presents an implemented and evaluated methodology for disambiguating terms in search queries. By exploiting Wikipedia articles and their reference relations, our method is able to disambiguate terms in particularly short queries with few context words. This work is part of a larger project to retrieve information graphics in response to user queries.
متن کاملAutomatic Image Annotation Exploiting Textual and Visual Saliency
Automatic image annotation is an attractive service for users and administrators of online photo sharing websites. In this paper, we propose an image annotation approach exploiting visual and textual saliency. For textual saliency, a concept graph is firstly established based on the association between the labels. Then semantic communities and latent textual saliency are detected; For visual sa...
متن کاملHybrid-LSH for Spatio-Textual Similarity Queries
Locality Sensitive Hashing (LSH) is a popular method for high dimensional indexing and search over large datasets. However, little efforts have put forward to utilizing LSH in mobile applications for processing spatio-textual similarity queries, such as find nearby shopping centers that have a top ranked hair salon. In this paper, we present hybrid-LSH, a new LSH method for indexing data object...
متن کاملTopic Level Disambiguation for Weak Queries
Despite limited success, today’s information retrieval (IR) systems are not intelligent or reliable. IR systems return poor search results when users formulate their information needs into incomplete or ambiguous queries (i.e., weak queries). Therefore, one of the main challenges in modern IR research is to provide consistent results across all queries by improving the performance on weak queri...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Pattern Recognition
سال: 2021
ISSN: ['1873-5142', '0031-3203']
DOI: https://doi.org/10.1016/j.patcog.2020.107620